This file describes the format of all files included in support of the publication "The 2022 Tonga Volcanic Tsunami: 
Lessons from a Global Event", submitted to Science in March of 2022


The database include four different kinds of records. All are provided as MATLAB datafiles (.mat), and have naming conventions
which employ the name of the station and different suffixes after the station name, and cover the time period from Jan 13th or 14th to the 20th, 
depending on source. More information about the exact location dataand data sources can be found in the supplementary materials of our 
submission, in Tables S1 and S5.

(1) Original water level data at 1-minute resolution downloaded from various sources (EC: European Commission; 
IOC: Intergovernmental Oceanographic Commission; NOAA: National Oceanic and Atmospheric Administration; 
LINZ: Land Information New Zealand; NIWA: National Institute of Water and Atmospheric Research (NZ).). 

All files in dataset (1) have only the station name without any suffix ("LOCATION.mat").

308 files are provided in dataset (1); see Table S1.

All files in dataset (1) include the following variables:

'h'--> 1-minute water level data as originally provided by the sources, units of mm.
'h_int' --> Interpolated 1-min water onto an even time index which fills missing values, units of mm.
't' --> Original time index in MATLAB datenum format (days since Jan 1,0000).
't_int --> Interpolated time index in MATLAB datenum format.
't_d' --> Time index converted to MATLAB datetime format ("Month-Day-Time")
't_d_int'--> Interpolated time index in MATLAB datetime format.

Data from the EC and NOAA sources also include tidal prediction values and residual water levels, with the following additional variables:

'p'--> 1-minute predicted water levels as originally provided by the source, units of mm.
'p_int' --> Interpolated 1-min predicted water levels on an even time index which fills missing values, units of mm.
'res' --> 1-minute residual water levels as originally provided by the source, units of mm.
'res_int' --> Interpolated 1-min residual water levelsdata to an even time index which fills missing values, units of mm.
'res_dm' --> Interpolated 1-min residual water level data with the mean value of the residual subtracted "Residual de-meaned"), units of mm.


(2) Final residual 1-min water levels after the application of an EEMD analysis to remove all tidal signals and low-frequency variability from the record.

308 files are provided in dataset (2); see Table S1.

Filenames in dataset (2) have the format of "LOCATION_res.mat"), and variables included are:

't_int' --> Interpolated 1-min time index in MATLAB datenum format.
't_d_int'--> Interpolated time index in MATLAB datetime format.
'res_dm' --> Interpolated 1-min residual water level data with all non-tsunami signals and low-frequency variability subtracted via EEMD ("Residual de-meaned"), units of mm.


(3) Original air pressure data downloaded from various sources (NOAA: National Oceanic and Atmospheric Administration (US); NDBC: National Data Buoys Center;  
NIWA/NCD: National Institute of Water and Atmospheric Research National Climate Database (NZ); CMD: Chilean Meteorological Directorate; JMA: 
Japanese Meteorological Agency; ISPRA: Instituto Superiore per la Protezione e la Ricerca Ambientale (Italy/EU); BOM: Bureau of Meteorology (Australia).)

Data from CMD, BOM, and ISPRA are provided at 1-min resolution; data from NOAA and NDBC are at 6-min resolution; data from JMA and NIWA/NCD are at 10-min resolution.

137 files are provided in dataset (3); see Table S5.

Filenames in dataset (3) have the format of "LOCATION_atm.mat"), and variables included are:

'pr' --> Original air pressure data in native time format, units of mililbars (mb).
'pr_int' --> Air pressure data interpolated to a 1-min time index, units of mb.
't' --> Original time index in MATLAB datenum format (days since Jan 1,0000).
't_int --> Interpolated time index in MATLAB datenum format.
't_d' --> Time index converted to MATLAB datetime format ("Month-Day-Time")
't_d_int'--> Interpolated time index in MATLAB datetime format.


(4) Final residual air pressure (1-min resolution) after the application of an EEMD analysis to remove all tidal signals and low-frequency variability from the record.

137 files are provided in dataset (4); see Table S5.

Filenames in dataset (4) have the format of "LOCATION_atm_pr_res.mat"), and variables included are:

't_int' --> Interpolated 1-min time index in MATLAB datenum format.
't_d_int'--> Interpolated time index in MATLAB datetime format.
'res_dm' --> Interpolated 1-min residual air-pressure with all low-frequency variability subtracted via EEMD, units of mb.